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Abstract 

Schur duality decomposes many copies of a quantum state into subspaces labeled by par- 
titions, a decomposition with applications throughout quantum information theory. Here we 
consider applying Schur duality to the problem of distinguishing coset states in the standard 
approach to the hidden subgroup problem. We observe that simply measuring the partition 
(a procedure we call weak Schur sampling) provides very little information about the hidden 
subgroup. Furthermore, we show that under quite general assumptions, even a combination of 
weak Fourier sampling and weak Schur sampling fails to identify the hidden subgroup. We also 
prove tight bounds on how many coset states are required to solve the hidden subgroup prob- 
lem by weak Schur sampling, and we relate this question to a quantum version of the collision 
problem. 



1 Introduction 

The hidden subgroup problem (hsp) is a central challenge for quantum computation. On the 
one hand, many of the known fast quantum algorithms are based on the efhcient solution of 
the abelian hsp |26U27[I151I5U| . On the other hand, the nonabelian hidden subgroup problem 
has potential algorithmic applications: in particular, the graph isomorphism problem can be 
reduced to the hsp in the symmetric group [TnUTT) . and the shortest lattice vector problem can 
be reduced to the hsp in the dihedral group (assuming it is solved by the standard method 
discussed below) 03] ■ Unfortunately, no efficient algorithms are known for these two instances 
of the nonabelian hidden subgroup problem. However, some partial progress has been made: 
there is a subexponential time algorithm for the dihedral HSP |37U45| . and it is known how to 
solve the hsp efficiently for a variety of other nonabelian groups pi|2(J||22!^4 , 30 , 33 , 40 . 

In the hidden subgroup problem for a group G, we are given black-box access to a function 
/ : G — > S", where S is some finite set. We say that / hides a subgroup H < G provided 
/(g) = /(<?') if and only if g~^g' € H (or in other words, provided / is constant on left cosets 
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of iJ in G and distinct on different left cosets). The goal is to determine H (say, in terms of a 
generating set) as quickly as possible. In particular, we say that an algorithm for the hsp in G 
is efficient if it runs in time poly (log \ G\). 

Nearly all quantum algorithms for the hidden subgroup problem make use of the so-called 
standard method^ in which one queries the function / on a uniform superposition of group 
elements and then discards the function value, giving a coset state 

for some unknown, uniformly random g ^ G. In other words, the state is described by the 
density matrix 

Ph:^j^^Y.\9H){9H\ (2) 



geG 



' ' heH 

(sometimes called a hidden subgroup state), where R is the right regular representation of G, 
satisfying 

R{9)\9') = Wg^') (4) 

for all g, g' G G. Now the hidden subgroup problem is reduced to the problem of performing a 
measurement to distinguish the states pn for the various possible H < G. 

The symmetry of the hidden subgroup state can be exploited using Fourier analysis. In 
particular, the group algebra CG decomposes under the left and right multiplication actions of 
G (which commute, and indeed generate each other's commutants) as 



GxG 

creG 



CG ^ 0V.<^V: (5) 



where G denotes a complete set of irreducible representations (or irreps) of G, and Vo- and V* are 
the (row and column, respectively) subspaces acted on by cr € G. The unitary transformation 
that relates the standard basis for CG and the basis for the spaces Va ® V* is the Fourier 
transform, which can be carried out efficiently for most groups of interest pi ll5[l^ l39|. 

Since the hidden subgroup state pn is invariant under the left multiplication action of G, the 
decomposition ((Sj shows that it is block diagonal in the Fourier basis, with blocks labeled by the 
irreps cr e G. For each cr, there is a dim Va x dim Va block that appears dim Va times (or in other 
words, the state is maximally mixed in the row space). Thus, without loss of information (or 
in Kuperberg's words, "sacrificing no entropy"), we can measure the irrep name a and discard 
the information about which cr-isotypic block occurred (sometimes referred to as "discarding 
the row label" ) . 

The process of measuring the irrep name cr is referred to as weak Fourier sampling. If 
G is abelian, then all its irreps are 1-dimensional. In this case, weak Fourier sampling is a 
complete measurement, and all that remains is to use the statistics of the irrep name to infer 
the hidden subgroup. However, for the nonabelian hsp, some (indeed, typically most) irreps 
are more than 1-dimensional, so weak Fourier sampling is an incomplete measurement. Indeed, 
for most nonabelian groups, it turns out that weak Fourier sampling alone produces insufficient 
information to identify the hidden subgroup. In particular, this is true of the hsp relevant to 
graph isomorphism [^ISU] as well as the dihedral hsp. To obtain further information about 
the hidden subgroup, one must perform a refined measurement inside the (dim Vcr)-dimensional 
subspace that remains after weak Fourier sampling gives the result cr. The process of making 
such a refined measurement is referred to as strong Fourier sampling. There are many possible 
ways to measure inside Vo-, especially when it is high-dimensional, and much work has been 
done on the possible measurements for particular nonabelian groups. 
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Of course, with either weak or strong Fourier sampling, a single hidden subgroup state is 
not sufficient to determine H: one must repeat the sampling procedure to obtain statistics. 
However, if one simply repeats strong Fourier sampling a polynomial number of times, there 
are some groups — in particular, the symmetric group — in which hidden subgroups cannot be 
found, even if the measurement is allowed to be chosen adaptively Thus, to solve the 

HSP in general, one must perform a joint measurement on A: = poly(log \G\) copies of the hidden 
subgroup state, pf^ ■ In fact, there are groups — again including the symmetric group — for which 
the measurement must be entangled across r2(log|G|) copies |2n]- Thus the difficulty of the 
general HSP may be attributed at least in part to that fact that highly entangled measurements 
are required (although it should be noted that there are some groups, such as the dihedral 
group, where single-register measurements are at least information-theoretically sufficient |18) . 
yet an efficient algorithm remains elusive). While it is known that 0(log \ G\) copies are always 
information-theoretically sufficient |19| (so that, in particular, the query complexity of the HSP 
is polynomial), there are many groups for which it is not known how to efficiently extract the 
identity of the hidden subgroup. 

Although previous work on the hidden subgroup problem has focused almost exclusively 
on Fourier sampling, it turns out that there is another kind of measurement that can also be 
performed without loss of information. The idea is to exploit the fact that the state pf^ is 
symmetric under permutations of the k registers. Thus, we should consider the decomposition 
of the space (CG)'^'^ afforded by Schur duality [2^1 1 which decomposes k copies of a d-dimensional 
space as 

{C'')'^^''''^'" ^Vx®Qi (6) 

Ahfe 

where the symmetric group Sk acts to permute the k registers and the unitary group tid acts 
identically on each register. Here the subspaces Vx and Q'^ correspond to irreps of Sk and Ud, 
respectively. They are labeled by partitions A of A: (denoted A h fc), i.e., A = (Ai, A2, . . .) where 
Ai > A2 > • • ■ and Xj = k. Note that we can restrict our attention to partitions with at 
most d parts, since dim = if A^+i > 0. 

Since pf^ is invariant under the action of 5^, the decomposition JHJ) shows that it is block 

\G\ 

diagonal in the Schur basis with blocks labeled by A h fc. For each A, there is a dim ' x 
Id 

dim block that appears dim Pa times (or in other words, the state is maximally mixed in the 
permutation space). Thus, no information is lost if we measure the partition A and discard the 
permutation register. By analogy to weak Fourier sampling, we refer to the process of measuring 
A as weak Schur sampling. This is a natural measurement to consider not only because it can 
be performed without loss of information, but also because it is a joint measurement of all k 
registers, and we know that some measurement of this kind is required to solve the general 
HSP. Unfortunately, we will see in Section El (and sec also Corollary 0] below) that weak Schur 
sampling with k — poly (log |G|) provides insufficient information to solve the HSP unless the 
hidden subgroup is very large (in which case the problem is easy, even for a classical computer). 

In fact, since both weak Fourier sampling and weak Schur sampling can be performed without 
loss of information, it is possible to perform both measurements simultaneously (with the caveat 
that one must discard the irrelevant information about the order in which the irreps of G 
appear). Even though the statistics of the irrep name a and the partition A do not provide 
enough information to identify the hidden subgroup, this does not preclude the possibility that 
their joint distribution is more informative. However, we will see in Section |31 that unless we 
are likely to see the same representation more than once under weak Fourier sampling (which is 
typically not the case), the Fourier and Schur distributions are nearly uncorrelated. Formally, 
we have 

Theorem 1 (Failure of weak Fourier-Schur sampling). The probability that weak Fourier- 
Schur sampling ( defined in Section 0j applied to pf^ ( defined in 0) ) provides a result that 
depends on \H\ is at most k'^\H\d'^g^^/\G\, where dmax is the largest dimension of an irrep ofG. 

This implies that k needs to be large for most cases of interest, including the dihedral and 
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symmetric groups. 

Corollary 2 (Weak Fourier-Schur sampling on and (a) Weak Fourier-Schur 

sampling on the dihedral group Vn cannot distinguish the trivial subgroup from a hidden re- 
flection with constant advantage unless k = fl{\/N). (b) Weak Fourier-Schur sampling on the 
symmetric group 5„ or on the wreath product 5„ I Z2 cannot distinguish the trivial subgroup from 
an order 2 subgroup with constant advantage unless k — exp(f2(-\/n)). 

While it is unfortunate that weak Fourier-Schur samphng typicaUy cannot provide enough 
information to find a hidden subgroup, it remains possible that Schur duality could be a helpful 
tool for solving the HSP. Just as weak Fourier sampling breaks the space into smaller subspaces 
in which one can perform strong Fourier sampling (or a multi-register generalization thereof), 
weak Fourier-Schur sampling provides an even finer decomposition into subspaces. Since the 
latter decomposition produces subspaces that are entangled across all k registers, it may be 
especially useful for hsps where we know that entangled measurements are necessary (or for 
which we would like to implement optimal measurements, which are typically entangled |3|3|). 

The proof that weak Schur sampling fails is based on the simple observation that distin- 
guishing the trivial subgroup from a subgroup of order in this way requires one to solve the 
problem of distinguishing 1-to-l from |i/|-to-l functions on G, i.e., the -collision problem 
for a list of size \G\. Since there is an ri( ■^/|G|7|^) quantum lower bound for this problem 
poly(log \G\) registers are insufficient. In fact, the problem resulting from the HSP is potentially 
harder, since the basis in which the collisions occur is inaccessible to the Schur measurement. 
This naturally leads to the notion of a quantum collision problem, and raises the question of how 
quickly it can be solved on a quantum computer. In Section^ we define a sampling version of 
the quantum r-coUision problem and use results on the asymptotics of the Plancherel measure 
on the symmetric group to prove that k — Q{d/r) registers are necessary and sufficient to solve 
it. In particular, we prove 

Theorem 3 (Quantum collision sampling problem). Given p'^^ , distinguishing between 
[case A] p = I/d and [case B[ — p/^ (i.e., p is proportional to a projector of rank d/r) is 
possible with success probability 1 — exp(— 0(fcr/(i))/2. In particular, success probability ^ + $1(1) 
is possible if and only if k = Q(d/r). 

In addition to providing the first results on estimation of the spectrum of a quantum state in 
the regime where k <^ d^, this gives tight estimates of the effectiveness of weak Schur sampling, 
which we see requires an exponentially large (in log |G|) number of copies to be successful. 

Corollary 4 (Failure of Weak Schur sampling). Applying Weak Schur sampling to pfl' 
(where pu is defined in ^) requires k = Q{\G\/r) to distinguish the case \H\ > r from the case 
H = {1} with constant advantage. 

The connection between Theorem 13 and Corollary 21 is explained in Section |21 while Theorem |31 
is proved in Section 2| 

Finally, in Section 14.21 we introduce a black box version of the quantum collision problem. 
We show that it can be solved using 0{y^d/rlog d/r) queries, nearly matching the query lower 
bound from the classical problem (although the best algorithm we are aware of has a running 
time of Oi^yJ djr log d/r)). We also discuss some alternative formulations of the quantum collision 
problem that are less closely connected to the HSP, but that may be of independent interest. 

2 Weak Schur sampling 

We begin by considering only the permutation symmetry of k independent copies of the d- 
dimensional hidden subgroup states, without taking into account symmetry resulting from the 
group G. In other words, we will consider only the Schur decomposition ((HJ, and we will perform 
weak Schur sampling, i.e., a measurement of the partition A. 
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The projector onto the subspace labeled by a particular A h fc is 



„ dim Pa \ ^ ^ ^ ^ 

nA:=^p^ 5] XA(7r)PW (7) 

TreSk 

(see for example 1551 Theorem 8]), where xa is the character of the irrep of Sk labeled by A and 
P is the (reducible) representation of Sk that acts to permute the k registers, i.e., 

-P(7r)|il)|i2) . ■ • \ik) = |i7r-i(l))|«7r-i(2)> • ■ • l*7r-i(fc)) (8) 

for all ii, . . . , ifc G {1, . . . , d}. For any d'^-dimensional density matrix 7, the probability distri- 
bution under weak Schur sampling is 

Pr(A|7) = tr(nA7) ■ (9) 

To use weak Schur sampling as part of a quantum algorithm, it is important that the mea- 
surement of A can be carried out efficiently. The simplest implementation of the complete Schur 
transform 5 , which fully resolves the subspaces V\ and Qf, runs in time poly(fc, d), and thus is 
inefficient when d is exponentially large, as in the hidden subgroup problem. It can be modified 
to run in time poly(fc, log d) either by a relabeling trick footnote in Section 8.1.2] or by using 
so-called generalized phase estimation T,'31' (which may also be viewed as a generalization of the 
well-known swap test 8 , 13 ). Generalized phase estimation only allows us to measure A, but for 
weak Schur sampling this is all we need. In this procedure, we prepare an ancilla register in the 
state J^TTGSk 1""^' Perform a conditional permutation P(7r) on the input state 7, and 

then perform an inverse Fourier transform over Sk [HI on the ancilla register. Measurement of 
the ancilla register will then yield X G Sk, interpreted as a partition of fc, distributed according 
to ®. 

The distribution of A according to weak Schur sampling is invariant under the actions of the 
permutation and unitary groups, since these groups act only within the subspaces V\ and Qf, 
respectively. In other words, for any d'^-dimensional density matrix 7, we have 

Pr(A|C/^'=7C/^^'') = Pr(A|7) yUeUd (10) 
Pr(A|P(^)7P(^-i)) =Pr(A|7) V7re5fe. (11) 

(It is straightforward to verify these invariances by directly computing [Ha, U'^''] = [Ha, Pin)] = 
and using the cyclic property of the trace.) In particular, (|l()|l implies that for 7 = p^'', the 
distribution according to weak Schur sampling depends only on the spectrum of p. (In fact, it 
turns out that simply normalizing A provides a good estimate of the spectrum of p, provided fc 
is large jSE]-) 

Now it is easy to see that weak Schur sampling on fc — poly(log|G|) copies of the hidden 
subgroup state pn provides insufficient information to solve the hidden subgroup problem. The 
hidden subgroup state pn is proportional to a projector of rank IGl/ji?!, as can be seen by 
computing 

Ph-T^ Rihh')^^-^PH. (12) 

Since the distribution of measurement outcomes Pr(A|p'^'^) depends only on the spectrum of 
Ph, and the spectrum of pn depends only on the order of the hidden subgroup, it is clear 
that different subgroups of the same order cannot be distinguished by weak Schur sampling. 
Nevertheless, we might hope to distinguish the trivial hidden subgroup from a hidden subgroup 
of order | > 2, which would be sufRcient to solve the hsp relevant to graph isomorphism, for 
example. However, even this task requires an exponential number of hidden subgroup states, 
which can be seen as follows. 

Suppose that weak Schur sampling could distinguish between hidden subgroup states cor- 
responding to H = {1} and some particular H of order \H\ > 2. Since the distribution of A 



5 



depends only on the spectrum of the state, this would mean that we could distinguish k copies 
of the maximally mixed state /loi/jGI, where Id is the d x d identity matrix, from k copies of 
the state JiGi/i/fi/dGj/l-ff |), where J^' is a projector onto an arbitrary subspace of dimension 
d' . This in turn would imply that we could distinguish 1-to-l functions from |-to-l fmictions 
using k queries of the function. Then the quantum lower bound for the |-collision problem 
shows that k — fl{-\/\G\/\H\) copies are required. 

Of course, this does not mean that 0(-^/|G]7|i?I) copies are sufficient. In fact, it turns out 
that a linear number of copies is both necessary and sufficient, as we will show by a more careful 
analysis in Section^ There we will prove Theorem which by the arguments of this section 
implies Corollary^ 



3 Weak Fourier- Schur sampling 

In the previous section, we showed that weak Schur sampling provides insufficient information 
to efficiently solve the hidden subgroup problem. However, even though weak Fourier sampling 
typically also does not provide enough information, it is conceivable that the joint distribution of 
the two measurements could be substantially more informative. In this section, we will see that 
this is not the case: provided weak Fourier sampling fails, so does weak Fourier-Schur sampling. 

Because neither measurement constitutes a loss of information, it is in principle possible to 
perform both weak Fourier sampling and weak Schur sampling simultaneously. If we perform 
weak Fourier sampling in the usual way, measuring the irrep label for each register, then we 
will typically obtain a state that is no longer permutation invariant. However, since the irrep 
labels are identically distributed for each register, the order in which the irreps appear carries 
no information. Only the type of the irreps, i.e., the number of times each irrep appears, is 
important. Thus, it is sufficient to perform what we might call weak Fourier type sampling, in 
which we only measure the irrep type. Equivalently, we could perform complete weak Fourier 
sampling and then randomly permute the k registers, thereby discarding the (irrelevant) in- 
formation about the order of the irreps, and producing a permutation- invariant state to use 
for weak Schur sampling. (In fact, since the distribution under weak Schur sampling has the 
permutation invariance Hll(l . it is not actually necessary to explicitly symmetrize the state.) 

We begin by performing weak Fourier sampling. The hidden subgroup state pn defined in 
^ has the following block structure in the Fourier basis: 

Here the probability of observing the irrep a is 

' ' heH 

and the state conditioned on this observation is pH,a- Weak Fourier sampling consists of mea- 
suring the label of the irrep. Assume we have observed a. Then the resulting density operator 
is 

PH,.= ^ ^ ^ |a)(a|0a(M*. (16) 
l^henXaW 

We explicitly include the label \a) to reflect the fact that a was observed. 
Repeating weak Fourier sampling k times, we obtain the state 

PH,a = PH,cn (Xl • • • (81 PH,ak , (17) 
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where a_ :— (cti, (T2, ■ ■ ■ ,(Jk) S & may be viewed either as the actual outcome of the k instances 
of weak Fourier sampHng, or merely as a representative of the irrep type, as discussed above. 
Given this state, the conditional probability of observing the partition A is 

Pr(A|a) = tr(nA (18) 
^ dimP, ^ XAWtr[FWpHj. (19) 

Note that tr[P(7r) pH,a] = if 7r(a;) ^ a, where Tr{a) = {cr.^-i(i)j . . . , o'^-i(fe))- 
Now we are ready to prove Theorem ^ 

Proof of Theorem^ Let us assume that a is multiplicity-free, i.e., that all the at's are different. 
In this case the traces are zero for all tt 7^ 1, where 1 is the identity element of Sk- Then 

„ , dhnVx , , (dimPi)^ 

Pr(A|£) = — ^ xa(1) trpH,. - ^ ^ , (20) 

which is nothing but the Plancherel distribution over 5^ , and which in particular is independent 
of the hidden subgroup H . This shows that we cannot extract any information about iJ provided 
that we have obtained a multiplicity-free a_. 

It remains to show that the probability of obtaining a type q_ that is not multiplicity-free is 
vanishingiy small. This can be seen as follows. From H15(l . we have 

Pr(a)<(^^|i/| (21) 

since the absolute value of the character Xa at any group element can be at most dimVo-. If we 
perform weak Fourier sampling on two copies of then the probability of obtaining the same 
irrep twice is 

Pr(2 irreps same) = V Vxiaf < maxPr(cr) < -r^\H\ , (22) 

o-SG 

where dmax '■— ^^^aeG dim Vo- denotes the maximal dimension of any irrep of G. For k copies, 
we can upper bound the probability that any two irreps out of the k observed irreps are the 
same by a union bound. We find 

Pr(any 2 irreps same) < Pr(2 irreps same) < ^^j^ \H\ . (23) 

Therefore, provided \H\ and dmax are not too large, the probability of obtaining the same irrep 
twice is small. This concludes the proof of Theorem ^ □ 

We now focus on the important special cases of the dihedral and symmetric groups to prove 
Corollary 13 

Proof of Corollary\^ When G is the dihedral group Vn of order 2N, we have dmax — 2, so H23(l 
shows that weak Fourier-Schur sampling cannot distinguish the trivial subgroup from a hidden 
reflection (a subgroup of order 2) with constant advantage unless k = ^{^/N). Similarly, for 
the symmetric group G = 5„, we have dmax < -s/nl exp(— c-^n) for some constant c 53 , which 
implies that weak Fourier-Schur sampling cannot solve the hsp relevant to graph isomorphism 
unless k — exp(r2(-yn)). Finally, we can express the irreps of the wreath product Snl^2 in terms 
of the irreps of 5„ 03 Section 8.2], implying that d^ax < 2n! exp(— 2c-\/n). Since |5„ I Z2I = 
2(n!)^, we again have exponentially small distinguishability unless k = exp{Q{y^)). □ 
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In | 14| two of us considered an alternative quantum approach to the graph isomorphism 
problem based on the nonabelian hidden shift problem. It can be shown that weak Fourier-Schur 
sampling fails for similar reasons when applied to hidden shift states instead of hidden subgroup 
states. The hidden shift states have a similar block structure, where the blocks correspond to 
irreps of G (although in the hidden shift case, the distribution of a is exactly Plancherel over G, 
so weak Fourier sampling trivially fails). Again, the probability of seeing the same irrep twice 
is exponentially small, and unless some irrep appears twice, the conditional distribution of A is 
equal to the Plancherel distribution over Sk ■ 

4 The quantum collision problem 

In Section [3 we saw that weak Schur sampling cannot efficiently solve the hidden subgroup 
problem since such a solution would imply an efhcient solution of the collision problem. In fact, 
the problem faced by weak Schur sampling is considerably more difficult, since no information is 
available about the basis in which the collisions occur. This motivates quantum generalizations 
of the usual (i.e., classical) collision problem, which we study in this section. 

The classical r-collision problem is the problem of determining whether a black box function 
with d inputs (such that r divides d) is either 1-to-l or r-to-1. This problem has classical 
(randomized) query complexity Q{yjd/r) — as evidenced by the well-known birthday problem — 
and quantum query complexity d/r) [Tl lllj . The classical algorithm is quite simple: after 
querying the function on 0{^J d/r) randomly chosen inputs, there is a reasonable probability 
that a collision will appear, provided one exists. The quantum algorithm is slightly more subtle, 
making use of Grover's algorithm for unstructured search j2Hl- In particular, while the classical 
algorithm queries the black box non-adaptively, it is essential for the quantum algorithm to 
make adaptive queries. 

We begin in Section f4.1l bv considering what one might call the quantum r-collision sampling 
problem, which is simply the problem of deciding whether one has k copies of the rf-dimensional 
maximally mixed state or of a state that is maximally mixed on an unknown subspace of 
dimension d/r. This is exactly the problem faced by the weak Schur sampling approach to 
the hidden subgroup problem, so our results on the quantum collision sampling problem allow 
us to put tight bounds on the effectiveness of weak Schur sampling for the hsp. It turns 
out that k = <d{d/r) copies are necessary and sufficient to distinguish these two cases with 
constant advantage. Interestingly, this measurement is entangled across all k registers, and 
we do not know whether its performance can be matched by an adaptive sequence of single- 
register measurements, as is the case for asymptotically large k However, we show that 
non-adaptively measuring k sets of 0(1) registers each is much less effective, succeeding only if 
fc = e(dVr2). 

A complete definition of the quantum collision problem requires us to specify a unitary 
black box that hides the function, and that allows us to make adaptive queries. We propose one 
such definition in Section l4.2l and show that the resulting quantum r-collision problem can be 
solved in 0{^/dJr log d/r) queries, nearly matching the VL{^d/r) lower bound from the classical 
collision problem. We also mention some possible alternative formulations of the problem and 
comment on their complexities. 

4.1 The quantum collision sampling problem 

This section is devoted to the proof of Theorem 13 We consider the sampling version of the 
quantum collision problem, in which we are given k samples of a quantum state that is either 
maximally mixed on a Hilbert space of dimension d, or is maximally mixed on an unknown (but 
fixed) subspace of dimension d/r. We begin by showing the equivalence of this problem with 
some related problems. 
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Lemma 5. The following tasks are equivalent: 
(a) Distinguishing [case A] (I/d)^^ from [case B] 

(d/r \ '^'^ 

jy\]){3\U^ (24) 

where AU is the Haar measure on the d- dimensional unitary group lAd- 

(h) Distinguishing [case A] [I/d)®'^ from [case B] p®^ for arbitrary p satisfying p^ — p/j 
(i.e., proportional to a projector of rank d/r). 

(c) Distinguishing [case A] pf^ from [case B] pf'', where pi, P2 o,Te density matrices on , 
d' >d,pl= pi/d and pi = pa/^. 

Moreover, the optimal solution to each problem is to perform weak Schur sampling (i.e., the 
measurement with operators {Tl\}\\-k defined in ^) and then to perform classical processing 
on the measurement outcome. Both the quantum measurement and the classical processing can 
be performed in time poly(A:, logd). 

Proof. First note that (a) reduces to (b), which in turn reduces to (c). Thus equivalence of all 
three will follow from reducing (c) to (a). To do this, we will demonstrate an optimal strategy 
for (a) and show that it performs equally well when applied to (c) . 

Both states in (a) are invariant under the actions of Sk and Ud, so they can each be written 
in the form ^;j^Pr(A|7)nA/trnA, where 7 is the state appearing either in case A or case B, the 

are defined according to and Pr(A|7) = trnA7. Thus weak Schur sampling (followed by 
classical processing of the outcome A) is an optimal measurement strategy for problem (a). 

Now observe from O that Il\ can be defined as a weighted sum of permutations, and thus 
tillxp®'' depends on p only via the quantities tr p, tr p^, . . . ,trp'^ that appear when evaluating 
P{'k)p'®^ for tt S St- This means that Pr(A|7) is unchanged if we either embed (C'')®'^ into 
((^d'-)C5fc fQj. ^/ > i£ replace p®^ with [UpU^)®^, or indeed with E,7(C/p[/t)®'= for any 

distribution over unitaries U . This implies that Pr(A|7) is the same for tasks (a), (b) and (c), 
and that any solution for (a) implies an equally effective solution for (c). 

Finally, the measurement {nA}Ai-/c (i-c., weak Schur sampling) can be efficiently implemented 
using generalized phase estimation, as described in Section |21 The classical processing consists 
of comparing the conditional probabilities trllAp'^'^, for which we give efhciently calculable 
formulas below. □ 

Now we turn to the calculation of Pr(A) = trllAyo'^'^ for p satisfying p^ — p/d (and embedded 
into an ambient space of arbitrary dimension d' > d). We call the resulting distribution of Ah fc 
the Schur distribution, Schur(fc,(i), in which A appears with probability 

Pr(A) ^ ^JllEl^ (25) 

^ (dimT^A)' 
k\ 

(■'j)eA 

where in the second line we have used Stanley's formula for dim 51 , interpreting A as a 
Young diagram, where (?, j) G A iff 1 < j < Xi. 

Our goal is to show that the distributions Schur(fc, d) and Schur(fc, d/r) are close unless k 
is il{d/r). We will do this by comparing the Schur distribution to the Plancherel distribution 
over X G Sk, Planch(fc), for which 

P„A,.<i2Lp>!. ,27) 

Note that we can think of Planch(fc) as the d ^ 00 limit of Schur(fc, d). 

The key technical part of the proof of Theorem |3| is the following lemma: 



n (26) 
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Lemma 6. If2<k<d then 

k k 
— < II Schur(fc,d) - Planch(fc)||i < V2- . (28) 
36a a 

We note in passing that the upper bound can be amphfied when k > d as follows: 

Corollary 7. 

i|| Schur(fc,d) - Planch(fc)||i > 1 - e-To^esd-i) . (29) 

Proof of Corollary^ For density matrices p,a, we define the fidelity F{p,a) := \\^/p\/^\\l and 
recall the relations j^I] 

i-/f(p^< ^IIp-^tIIi < (30) 

where the second inequality is saturated when p and a are pure states. Now use Lemma to 
obtain 

a/I - F(Schur(d, d), Planch(d)) > i|| Schur(rf,rf) - Planch(d)||i > ^ . (31) 



F(Schur(fc, d), Planch(fc)) < F(Schur((i, d), Planch(d)) J (32) 

<(l " <e"iT^(5-i), (33) 



Next, 



5184, 

using first that fidelity is nondecreasing under quantum operations and is multiplicative under 
tensor products, then l|ST|l . and then the inequality 1 — x < . Finally, 

i|| Schur(fc,d) -Planch(fc)||i > 1 - v/-F(Schur(A:, d), Planch(A:)) > 1 - e^Towlz-i) ^ (34) 

which completes the proof. □ 

To derive Theorem O from Lemma El we need one additional fact. 
Lemma 8 (Monotonicity). For any integers k,di,d2 > 1 and r>2, 

||Schur(fc,rfi) - Schur(fc,rf2)|li > ||Schur(fc, rdi) - Schur(/c, rd2)|| 1 • (35) 
Proof. Define ad (for d one of di, c?2, ^di, rd2) as the state 

jy\j){j\uA (36) 

where AU is the Haar measure on the d'-dimensional unitary group hid' and d' is some arbitrary 
integer satisfying d' > r ■ max(di, ^2)- Let J\f be the quantum operation which takes as input a 
state on (C^ )®'^, adjoins the maximally mixed state (/r/r)®*^, draws a uniform U G Urd', applies 
C/®'^, and finally outputs a state on (C"* )^'^. Adjoining Ir/r will map a state proportional to 
a rank d projector to a state proportional to a rank rd projector. Thus, N{(Jd) = cr,d (up to a 
change in the ambient dimension, which is irrelevant by Lemma [SJ. Then (|35|) follows from the 
non-increase of trace distance under quantum operations. □ 

(It seems plausible that Lemma |H1 should also hold for non-integer r whenever rdi and rd2 are 
integers, but we know of no proof of this claim.) 
Now we turn to the proof of Theorem |3| 
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Proof of Theorem\^ The case when k < d is the simple one. Using first the triangle inequality, 
and then Lemma we find 



Schur(fc, d) — Schur(fc, d/r)\\ ^ 

< ||Schur(/c,d) -Planch(/c)||i + ||Schur(/c, d/r) 

< ^tL^ < 0{kr/d) . 



Planch(fc)||i 



(37) 
(38) 



This establishes the il{d/r) lower bound on the number of copies. 

Now, for any choice of d and r, we would like to show that k = d/r copies is sufficient to 
obtain || Schur(fc, d) — Schur(A;, d/r)|| i > The achievability part of Theorem 01 will then 

follow from the same sort of amplification argument we used in the proof of CoroUaryd Choose 
v to be the smallest positive integer satisfying r'^ > 72\/2; note that v <7. Then we use both 
the upper and lower bounds of Lemma to find 



Schur ( — , — 
r r 



Schur I — , dr" 

r 



Planch I — 
r 

Planch ( — 



> — 
- 36 

V2 1 
< — < — 



(39) 



(40) 



Next we abbreviate Sk,d Schur(A;, d) and P^ 
bound 



Planch(fc) and use the triangle inequality to 



S d d S d 



,d 



Sd 



S d 



.dr 



S d 



S d 



(41) 



> 



S d d — P d 



S d 



-1 - p. 



> — 

1 - 72 



According to Lemma |S| the first term on the left hand side of H41|) is the largest. Thus, 



Schur 



d d 



Schur 



1 1 
-72^-504 



This concludes the proof of Theorem |3| 



(42) 
□ 



Our only remaining task is to prove Lemma |S1 establishing tight bounds on the distance be- 
tween Schur(A:, d) and Planch(A;). Here our primary tools are explicit formulas for the Plancherel 
averages of certain functions of Young diagrams over Sk ■ 

Proof of Lemma\^ Let Ak^d denote the total variation distance between Schur(A:, d) and Planch(fc). 
According to and (P7|l . A^^^ := || Schur ( fc, d) — Planch(fc)||i can be written 



Ak.d 



E 

Ahfc 



n 



(43) 



where the expectation is taken over Planch(fc). 

We will now derive both upper and lower bounds on A^.d from (|43|) . starting with the 0{k/d) 
upper bound. By the Cauchy-Schwarz inequality (or concavity of the square root and Jensen's 
inequality), 

2 



Ahfc 



E 

Ahfc 



n '1 

(«j)eA 



n 

(*j)eA 



- 1 



2 n 



J - « 



E n 



Ahfc 



- 1. 



(44) 

(45) 
(46) 
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where in the last hne we have used hnearity of expectation and the fact that Schur(fc, d) is 
normahzed. Now using the inequality 1 + x < e^, we have 



Al,< E^exp|2 ^ 

(ij)6A 



(47) 



^ m\ (P^ Ahfc 

m — 1 



E «i(A)" 



(48) 



where 

«i(A):= E (j'-*) (49) 

can be viewed as a measure of the asymmetry of A. 

Intuitively, we expect this to be small since typical Plancherel-distributed Young diagrams 
are close to a certain limiting shape that is symmetric, and in which the length of the longest row, 
Ai, is close to |^{8ll,'^2j (in fact, Ai is exponentially tightly concentrated around this value 
|Zj). Our proof will use results obtained by Kerov in the course of describing the asymptotically 
Gaussian fluctuations about the limiting shape .'!•") . 

It turns out that vi can be expressed exactly in terms of a central character of the symmetric 
group [331, namely 



(50) 



^2 J dimVx 

where r is any representative of the conjugacy class of transpositions. Then using the results of 
Kerov |35[ equation (2.5) and Lemma 2.3], we find 

E wi(A)2™-i 



Ahfc 



= 



Ahfe ' 4™ to! (fc - 2m)! 
for TO = 1, 2, . . .. Using these expressions in (|48ll . we find 

2m 



< 



(2TO)!A:2'i 
4™ to! 



V2 



oo 



1 



711 — 1 



TO 



1 < 2 



(51) 
(52) 

(53) 



where in the second inequality we have assumed k < d. This shows that Ak^d — o(l) provided 
k = o(d), and indeed, that A^^^ is exponentially small (in logo?) if fc = d^~'^ for any fixed e > 0. 

Now we establish the Vl{k/d) lower bound for || Planch(fc) — Schur(fc, d)||i, again by a careful 
examination of the distribution of wi(A). If S is any subset of the partitions of k, then 



Afc,d> E 



Ahfc 



1 n '1 

(»j)eA 



] 



6{\ e S) 



Pr(Ae^)- E^ n (l 



(*,i)6A 

> Pr(A e 5) - E e"i(^)/'^(5(A e S) 

Ahfc 



> Pr(A e S) ( l-niaxe"i(^)/'' 



6{\ e S) 



(54) 

(55) 
(56) 
(57) 



Thus, if we can show that wi(A) has a reasonable probability of being very negative, we will find 
that Afc^d is appreciable. Under the Plancherel distribution, the probability of a diagram A and 
its transposed diagram A"^ (satisfying ui(A"^) — —vi{\)) are equal, so by symmetry, 



Pr(«i(A)<-«) = ipr(t;i(A)2>«2). 



(58) 
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Letting S be the set of A's with i'i(A) < —v (for some w > to be specified later), we have 



Now we use H52|) for m = 1 and m ~ 2, i.e., 



E i;i(A)" 



E vAXr = -k(k -l)(k- 2)(k ~ 3) 

Xhk 4 



to show that 



E 1^1 (A)* - 



E vAXY 

Xhk 



1 k'^ 
= -k{k - l)(2fc2 _ I5fc + 18) < — 



Applying a one-sided variant of the Chebyshev inequality, namely j47| 



Pr(X > /i - acr) > 



a 



1 + a2 



where EX and a"^ := EX"^ — fi"^, we find 

Pr (-i(A)^>(2 



- > 



y/2j ~ l + a^ 

Choosing (say) a — we have Pi{vi{X)'^ > jk'^ — \k) > i; plugging this into l(5^ gives 

k 



> 



18 



1 — exp 



2d 



k 



(59) 

(60) 
(61) 

(62) 
(63) 
(64) 

(65) 



which is r2(l) provided k = 0{d). To obtain a specific numeric bound, we divide into cases. If 
2 < A: < 5, then Ak^d > A2,d — l/d> k/36d, where the equality can be verified by substituting 
k = 2 into (gSll. If 6 < A: < d, then i(l - f )i/2 > so Ak,d > i^(l - 6"'=/^''). Next, the 
convexity of e"^/^ implies that e"^/^ > 1 - xe^^/^ > 1 - x/2. Thus A^^d > fc/36d. □ 

To put the results of this section in context, we review the situation when fc 3> rf or fc <C d. 
When k ^ oo with d held constant, applying the measurement {IlAjAi-fc to p^'^ and outputting 
A := A/fc has long been known to be a valid estimator of the spectrum of p 36 . Indeed, if 
ri > . . . > Td are the eigenvalues of p, then |16II32| proved that 



trHAp''^ < (k + if'^'^-^^/^ e^p {^kD{X\\r)) 



(66) 



where D{p\\q) := ^iPi^og{pi/qi) is the (classical) relative entropy. In fact, this coefRcient of 
k in the exponential is optimal as A: ^ oo. This inequality is usually only interesting when 
k = f2((i^), so our Theorem |31 can be viewed as the first positive result for spectrum estimation 
in the regime where k = o{d^). 

What about the other limit, when k ^ d7 The only constructive result previously known 
was the swap test, which yields A2,d = 1/rf- However, 



F(Schur(2,d),Planch(2)) 



1 - O (1/d^ 



(67) 



so repeating swap tests will actually require k — fl{d^) copies before achieving distin- 
guishability. Indeed, k — ^{d"^) copies are needed for distinguishability even if we perform 
the optimal individual measurement on 0(1) copies at a time. This follows from the following 
variant of Lemma 
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Lemma 9. 



F (Schur(/c, d), Planch(fc)) > 1 - 



12d2 



C 



-\/fe 



4d2 ' 



(68) 
(69) 



F(Schur(A:,d),Planch(fc)) > 1 

where C > \ is a universal constant. 

These bounds are not quite strong enough to match the 1 — 0{k^ /cP) upper bound on the 
fideHty from Lemma |^ (which we conjecture is tight), but they still prove that achieving the 
performance of the optimal measurement requires operations that are entangled across many 
copies. 



Proof. We begin with a direct calculation of 

/^_^ := i^(Schur(fc,d),Planch(fc)) = E 



Ah/c 



\ 



n 

(»j)eA 



] - * 



(70) 



Consider the contribution from a particular A, and suppose for now that A = A"'". Then (i, j) € A 
if and only if (j, i) e A, so each factor of 1 + [j — i)/d is accompanied by a corresponding factor 
of 1 — (j — i)ld (unless i = j, in which case this term is 1 and can be ignored). Combining these 
terms, the entire product can be written as 



. n 



3 - ' 



n 



1 - 



(.7 - if 
d^ 



(71) 



Otherwise, suppose that X . Recall that A and A"^ have the same probability under the 

Plancherel distribution. Thus the terms corresponding to A and A"'" appear in (|70() with the 
same coefficient. Next we use the identity x + y > 2y^xy (for a;, y > 0) to show 



n 



J 



> 2 



n 



1 - 



d2 



(72) 



Combining H71|l and ((72|l . we can lower bound ((70|l as 



fk,d> E 



n 



d^ 



> 1 



fc3 
12d2 



(73) 



where the last inequality uses the fact that the product is minimized for the diagram A = (fc). 
This proves 

The proof of l|69() is mostly the same, but also uses bounds on large deviations of Ai for 
Plancherel-distributed A. In particular, equation (1.20) of 7 implies that under Planch(/c), 

Pr(Ai > 3\/fc) < (74) 

for a universal constant C > 1 (in fact, C — exp(6cosh^^(3/2) + 2-\/5) ~ 28189). Since {i,j) G A 
implies that ^ — i < Ai — 1, we can lower bound the middle expression in (|73|l to give 



fk,d > Pr(Ai < 3Vk) ( 1 - 



9k 



> 1-C 



9P 
4d2 ' 



which completes the proof. 



(75) 
□ 



It is interesting to consider the classical analog of applying the swap test to the collision 
problem. If we compare fc/2 sets of 2 objects each, then estimating the number of collisions 
up to a constant multiple requires k = Q{d). This is also quadratically worse than the optimal 
collective estimation procedure on k objects, but for apparently different reasons. 
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4.2 A black box for the quantum collision problem 

We now define a version of this problem that allows for adaptive algorithms by introducing a 
quantum oracle. The oracle implements the isometry 

N>-K>lV'/w), (76) 

where B := , . . . , \ il'd)} is an arbitrary (unknown) orthonormal basis of and / is either a 
one-to-one function or an r-to-one function. The goal is to determine which is the case using as 
few queries as possible. In fact, we assume that the isometry is extended to a unitary operator 
R acting on <Si C by 

\t)\y)^\t)U\y(B.fii)), (77) 

where U :— |V'i)(*l is the unitary matrix effecting a transformation from the standard basis 
to B. We also assume we can perform its inverse i?^. 

By considering the case where the basis B (or equivalently U) is known, it is clear that the 
quantum lower bound for the usual collision problem J. implies an Q,{^/dJr) lower bound on 
the number of queries for the quantum collision problem as well. We present an algorithm for 
this problem that requires only O {^/djr log d/r) queries, nearly matching the lower bound. 

The basic tool used in this algorithm is the following amplified version of the swap test: 

Lemma 10 (Amplified swap test with small disturbance). Suppose we are given a quan- 
tum state p satisfying 

\\\p-\^)m,<^ (78) 

where 

|V)=5]a.|a.)«™|A)«'"|7.) (79) 

i 

with (7i|7j) — 5i,j and ja^p — 1. We further assume that for each i, 9i := {ai\l3i) is equal 
to either or 1. Then there exists a unitary operation U , implementahle with 0{m) gates, such 
that 

i||C/pC/t-|^')(^'l||i<e + 2^-"/^ (80) 

where 

W) ■.= Y,a,{-lt\a.r^\l3,n..) . (81) 

i 

Proof. The procedure for implementing U is as follows. First perform the m-fold swap test 
without the final measurements. Conditioned on any swap test showing that the states are 
orthogonal, record this answer in an ancilla qubit; otherwise, record that they are the same. 
Apply a phase of —1 in the "same" case. Finally, uncompute the ancilla qubit along with all 
the swap tests. 

To analyze J7, we initially assume we are acting on Let 11 denote the projector that 

results from each swap test returning the symmetric outcome, so that J7 = 1 — 211. Since the 
swap test will always project |ai)|ai) onto the symmetric space, we have 

n|a.)«™|A)®"|7.) = I«.)^"|ft)^'"l7^) (82) 
whenever 6*^ = 1. On the other hand, li Oi = then 

n|a,)«"|A)«™|7.) = -^Wn.) (83) 
V 2 

where \'qi) := {\ai)\f3i) + \f]i)\ai)). Combining these, we obtain 

U\i^) = E °« ((-!)'' - '5e.,o2^-^ h.)^") |7.) (84) 

i 

= \r^')-J2a,Se,.o2'-'^hr"'\l^). (85) 
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Now the fidelity between U\ip) and is 



F {\i,'){^'\,u\^){m^) = mum 



> |l - 2i""|^ > 1 - 2 



(86) 

2-m ^37) 



Finally, 



2 

<l\\p-wmi + 2 



(89) 



where H88|l uses the triangle inequality and H89|l uses first the unitary invariance of the trace 
norm and then ijHZI) and □ 

We will use this amplified swap test to prove 
Theorem 11. The quantum r-collision problem has query complexity 0{ \/ d/r log d/r). 

Proof. We give an explicit algorithm based on the quantum algorithm for the usual collision 
problem ^J. We first outline that original algorithm and then show how to extend it to the 
quantum problem. 

The algorithm builds a table of the first \fdfr items and uses Grover's algorithm to search 
the remaining items for an entry of the table. Set if := {1, . . . , \fdfr^ and K :— \\^ . . . — K . 
Then the algorithm consists of two steps: 

1. Build the tabic L := {{i, f{i)) -.ieK}. 

2. Use Grover search to find j £ K such that F{j) = 1, where the function F : K ^ {0,1} 
is defined by F{j) = 1 if there is i S if with /(«) — f{j) and F{j) = otherwise. 

The first step uses 0{^fdjr) queries. In the second step, the function F must be computed 
reversibly, so two queries {R and R)) are used for each call to F. Note that it is important that 
F can be computed on a superposition of inputs. The total number of queries in the second step 
is twice the number of Grover iterations needed to find a marked item, which we now analyze. 

Assume that / is an r-to-one function. The probability that there is any collision among 
the elements of K is only 0[[r / d)^/^), implying that the number of j & K with F{j) = 1 is 
(r — l){d/ry^^ with high probability. Therefore, the number of Grover iterations necessary to 
find a solution is 0( v/rf/[r(d/r)i/3]) 0((d/r)i/3). in summary, the total number of queries of 
the above algorithm is 0((d/r)^/'^). 

Now we adapt this algorithm to the quantum collision problem. Some adaptation is necessary 
because it is not possible to directly check equality of two quantum states \ipj) and 
However, using Lemma 11(11 we can effectively test equality using m := 2 + 2 log d/r copies of the 
quantum states, increasing the query complexity only by a factor of 0{\ogd/r). 

For this to work, it is important that we can reuse the states corresponding to the entries 
in the table. Iterating Lemma IIUI we find that the error after £ Grover iterations is at most 
i ■ 2^"™/^ < ir/d. Since the number of Grover iterations is ^ = 0{{d/ry^^), the total error 
is asymptotically negligible, and we obtain essentially the same performance as in the classical 
collision problem. □ 

So far, we have only considered the query complexity of the quantum collision problem, 
without worrying about the algorithm's running time. The quantum algorithm for the classical 
collision problem can run in time 0(-^c?/r log d/r) (i.e., only logarithmically greater than the 
query complexity), since after sorting the table L (in time 0( -^d/r logd/r)), the function F can 
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be computed in time 0(log d/r) For the quantum collision problem, there is no obvious way 
to preprocess the table of quantum states. However, by using Grover's algorithm to implement F 



in time 0{\/ {d/rY^^ log d/r), we can obtain a running time of 0{y^ d/r log d/r) — substantially 

greater than the query complexity, but better than implementing F by brute force search. Note 
that to achieve these running times (for the both classical and quantum collision problems), we 
must assume a quantum RAM model. In other words, we must assume that it is possible to 
access any desired location in the table — or any superposition thereof — in constant time (see for 
example 1,42!, Section 6.5]). If we restrict ourselves to the quantum circuit model, not allowing 
parallelization, then the running time is even longer for both problems. 

We conclude this section by briefly considering some alternate versions of the quantum 
collision problem, one that has a fairly simple solution, and two about which less is known. 

• Input in an unknown basis: Our definition of the black box for the quantum collision 
problem in (j77|l gives the output in an unknown basis. Alternatively, we can consider a 
problem where the input is taken in an unknown basis. Here we define the oracle R to act 
as R [U\i) (S) \y)] — U\i) ($\y (B /(«)), where U is again an unknown unitary and / is either 
one-to-one or r-to-one, and the problem is to determine which is the case. This problem 
was suggested by Scott Aaronson; the solution we give here was also found independently 
by Andris Ambainis. 

The algorithm first queries R on (d/r)^/^ basis states of the form \j) (g) |0) for uniformly 
random j, and then measures the second register, producing a table L of uniformly random 
function values /(«), which are all distinct with high probability. Then we perform a Grover 
search for a state U\i) such that f{i) G L. There are two components to the Grover search. 
First, we need to identify our target space. Let H :— J2ief-^{L) ^ ■ Note that trH 

is either (d/r)^/'^ or r{d/r)^^^ , and that we can perform the reflection 1 — 2H using only 
two queries (one of R and one of R^). This implementation can be done by brute force 
in time 0{r{d/r)^/'^), or by Grover search in time 0{^/r{d/ry7^) — either way, regardless 
of whether / is one-to-one or r-to-one, and using only two queries. Next, we need a 
starting state \tjj) that we can efficiently map to and from the |0) state, so that we can 
both initialize it and perform the reflection 1 — 2 \ip){ijj\. One such is a state that is 
maximally entangled between the first register and an ancilla register of equal dimension. 
Then we have U)]!/;) = tr H/d. 

The algorithm now alternates £ times between the operations 1 ~ 2 {^|;\ and 1 — 2n. We 
choose £ so that if / is r-to-one then the probability of finding a solution is nearly 1, while 
if / is one-to-one then the probability is far from one. Thus, we will be able to distinguish 
the two cases with constant advantage. Following jl2|, we set 



The probability of finding a solution is > 1 — 0(r^/'^/d^/'^) if / is r-to-one and is < 
1/r -I- 0(r^/^/d^/^) if / is one-to-one. 

If instead both the input and output bases are unknown, then we know of no algorithm bet- 
ter than making 0{d) queries with random inputs and using the algorithm of Section |4. II 

• No access to R^ : Suppose that instead of the full unitary oracle H77(l . R is simply the 
isometry H76|l . mapping \i) to In this case, the best algorithm we know queries 
the oracle 0(logd) times on each of Vd inputs (so that there is a collision among those 
inputs with reasonably high probability), and then searches the resulting table (either by 
brute force or by Grover search; in either case, making no further queries) using Lemma ITUl 
to perform comparisons. We conjecture that this is optimal (up to the log factor), but we 
know of no lower bound better than the Q{d^^^) bound from the classical problem. 

• Black box rank estimation: Suppose again that R is an isometry, but now mapping \i) 
to \i)\ipi), and we are only promised that either forms a basis for or that 





(90) 
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3 Si is proportional to a projector onto a ((i/r)-dimensional subspace. Here the 

best algorithm we know is to query R on 0{d) inputs and use the rank estimation proce- 
dure from Section E!n We conjecture that this is optimal, but again have only the lower 
bound of 17(^1/3). 

5 Discussion 

In this article, we have shown that weak Fouricr-Schur sampling typically provides insufficient 
information to solve the hidden subgroup problem. Nevertheless, it remains possible that Schur 
duality could be a useful tool for the hsp. Just as weak Fourier sampling can always be used to 
refine the state space into smaller subspaces, even when it alone fails to solve the hsp, so we can 
use weak Fourier-Schur sampling to decompose the space even further. The Schur decomposition 
has the additional complication that the refined subspaces are no longer simply tensor products 
of single-copy subspaces, but this may actually be an advantage since entangled measurements 
are known to be necessary for some groups. 

In principle, strong Fourier-Schur sampling is guaranteed to provide enough information 
to solve the hsp, simply because the hidden subgroup states are always distinguishable with 
k = poly(log|G|) copies. However, it would be interesting to find a new efficient quantum 
algorithm for some hsp based on strong Fourier-Schur sampling. Perhaps a first step in this 
direction would be to analyze the performance of measurement in a random basis, as has been 
studied extensively in the case of weak Fourier sampling |24II4(J[I1^I48^ . 

Moving away from our original motivation of the hidden subgroup problem, the quantum 
collision problem may be of independent interest. As discussed in the latter part of Section [4. II 
our results on the quantum collision sampling problem can be viewed as an exploration of 
spectrum estimation with k = o{cP) copies, but much remains unknown about that regime. 
Many open problems also remain regarding the black box version of the quantum collision 
problem: Can the running time be reduced? What if both input and output registers are in 
unknown bases? Is the inverse oracle essential? And is black box rank estimation really no 
easier than rank estimation by sampling? 
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